AITopics | Gandaki Province

Collaborating Authors

Gandaki Province

Parameter-efficient Fine-tuning for improved Convolutional Baseline for Brain Tumor Segmentation in Sub-Saharan Africa Adult Glioma Dataset

Adhikari, Bijay, Kulung, Pratibha, Bohaju, Jakesh, Poudel, Laxmi Kanta, Raymond, Confidence, Zhang, Dong, Anazodo, Udunna C, Khanal, Bishesh, Shakya, Mahesh

arXiv.org Artificial IntelligenceDec-18-2024

Automating brain tumor segmentation using deep learning methods is an ongoing challenge in medical imaging. Multiple lingering issues exist including domain-shift and applications in low-resource settings which brings a unique set of challenges including scarcity of data. As a step towards solving these specific problems, we propose Convolutional adapter-inspired Parameter-efficient Fine-tuning (PEFT) of MedNeXt architecture. To validate our idea, we show our method performs comparable to full fine-tuning with the added benefit of reduced training compute using BraTS-2021 as pre-training dataset and BraTS-Africa as the fine-tuning dataset. BraTS-Africa consists of a small dataset (60 train / 35 validation) from the Sub-Saharan African population with marked shift in the MRI quality compared to BraTS-2021 (1251 train samples). We first show that models trained on BraTS-2021 dataset do not generalize well to BraTS-Africa as shown by 20% reduction in mean dice on BraTS-Africa validation samples. Then, we show that PEFT can leverage both the BraTS-2021 and BraTS-Africa dataset to obtain mean dice of 0.8 compared to 0.72 when trained only on BraTS-Africa. Finally, We show that PEFT (0.80 mean dice) results in comparable performance to full fine-tuning (0.77 mean dice) which may show PEFT to be better on average but the boxplots show that full finetuning results is much lesser variance in performance. Nevertheless, on disaggregation of the dice metrics, we find that the model has tendency to oversegment as shown by high specificity (0.99) compared to relatively low sensitivity(0.75). The source code is available at https://github.com/CAMERA-MRI/SPARK2024/tree/main/PEFT_MedNeXt

artificial intelligence, deep learning, machine learning, (12 more...)

arXiv.org Artificial Intelligence

2412.141

Country:

Africa > Sub-Saharan Africa (0.41)
North America > Canada > Quebec > Montreal (0.14)
Africa > South Africa > Western Cape > Cape Town (0.04)
(6 more...)

Genre: Research Report > Experimental Study (0.46)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Health & Medicine > Nuclear Medicine (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

DAAL: Density-Aware Adaptive Line Margin Loss for Multi-Modal Deep Metric Learning

Gebrerufael, Hadush Hailu, Tiwari, Anil Kumar, Neupane, Gaurav, Hailu, Goitom Ybrah

arXiv.org Artificial IntelligenceNov-5-2024

Multi-modal deep metric learning is crucial for effectively capturing diverse representations in tasks such as face verification, fine-grained object recognition, and product search. Traditional approaches to metric learning, whether based on distance or margin metrics, primarily emphasize class separation, often overlooking the intra-class distribution essential for multi-modal feature learning. In this context, we propose a novel loss function called Density-Aware Adaptive Margin Loss(DAAL), which preserves the density distribution of embeddings while encouraging the formation of adaptive sub-clusters within each class. By employing an adaptive line strategy, DAAL not only enhances intra-class variance but also ensures robust inter-class separation, facilitating effective multi-modal representation. Comprehensive experiments on benchmark fine-grained datasets demonstrate the superior performance of DAAL, underscoring its potential in advancing retrieval applications and multi-modal deep metric learning.

learning, loss function, recognition, (13 more...)

arXiv.org Artificial Intelligence

2410.05438

Country:

North America > United States > Iowa (0.04)
Asia > Nepal > Gandaki Province > Kaski District > Pokhara (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
(5 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Mero Nagarikta: Advanced Nepali Citizenship Data Extractor with Deep Learning-Powered Text Detection and OCR

Dhakal, Sisir, Sigdel, Sujan, Paudel, Sandesh Prasad, Ranabhat, Sharad Kumar, Lamichhane, Nabin

arXiv.org Artificial IntelligenceOct-8-2024

Transforming text-based identity documents, such as Nepali citizenship cards, into a structured digital format poses several challenges due to the distinct characteristics of the Nepali script and minor variations in print alignment and contrast across different cards. This work proposes a robust system using YOLOv8 for accurate text object detection and an OCR algorithm based on Optimized PyTesseract. The system, implemented within the context of a mobile application, allows for the automated extraction of important textual information from both the front and the back side of Nepali citizenship cards, including names, citizenship numbers, and dates of birth. The final YOLOv8 model was accurate, with a mean average precision of 99.1% for text detection on the front and 96.1% on the back. The tested PyTesseract optimized for Nepali characters outperformed the standard OCR regarding flexibility and accuracy, extracting text from images with clean and noisy backgrounds and various contrasts. Using preprocessing steps such as converting the images into grayscale, removing noise from the images, and detecting edges further improved the system's OCR accuracy, even for low-quality photos. This work expands the current body of research in multilingual OCR and document analysis, especially for low-resource languages such as Nepali. It emphasizes the effectiveness of combining the latest object detection framework with OCR models that have been fine-tuned for practical applications.

citizenship card, dataset, text detection, (13 more...)

arXiv.org Artificial Intelligence

2410.05721

Country:

Europe > Switzerland (0.04)
Asia > Nepal > Gandaki Province > Kaski District > Pokhara (0.04)
Asia > India (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.84)

Add feedback

The Nexus of AR/VR, Large Language Models, UI/UX, and Robotics Technologies in Enhancing Learning and Social Interaction for Children: A Systematic Review

Paneru, Biplov, Paneru, Bishwash

arXiv.org Artificial IntelligenceSep-26-2024

The combination of large language models (LLMs), augmented reality (AR), and user interface/user experience (UI/UX) design in therapies for children, especially with disorders like autism spectrum disorder (ASD), is examined in this review study. Three primary areas are covered in this review: how AR can improve social and learning results; how LLMs can help with communication; and how UI/UX design affects how effective these technologies are. Results reveal that while LLMs can provide individualized learning and communication support, AR has demonstrated promise in enhancing social skills, motivation, and attention. For children with ASD, accessible and interesting interventions depend heavily on effective UI/UX design. To optimize the benefits of these technologies in ASD therapies, the study emphasizes the need for additional research to address difficulties related to customization, accessibility, and integration. Keywords: Autism Spectrum Disorder, Large Language Models (LLM), Augmented Reality (AR), Virtual Reality (VR) 1. Introduction Children with autism can benefit greatly from digitally assisted language therapies thanks to augmented reality (AR). Numerous results and insights about the use of augmented reality (AR) as a teaching and pedagogical aid have been reported by educators and researchers [1]. The use of computer technology--particularly augmented reality--in autism spectrum disorder (ASD) therapies has grown as a means of treating or mitigating the symptoms of the disorder. Not just for kids of a certain age or educational level, augmented reality is an entertaining form of technology that facilitates easy interaction and helps kids comprehend and retain information [2]. A neurodevelopmental disorder known as autism spectrum disorder (ASD) is marked by recurring problems with social interaction and communication, as well as a limitation in interests and repetitive activities [3]. It is believed that one in every 100 youngsters worldwide is affected by ASD.

artificial intelligence, large language model, natural language, (13 more...)

arXiv.org Artificial Intelligence

2409.18162

Country:

Europe > Switzerland > Basel-City > Basel (0.04)
Asia > China (0.04)
South America > Colombia > Atlántico Department > Barranquilla (0.04)
(2 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Overview (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology > Autism (1.00)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Analysis of Convolutional Neural Network-based Image Classifications: A Multi-Featured Application for Rice Leaf Disease Prediction and Recommendations for Farmers

Paneru, Biplov, Paneru, Bishwash, Shah, Krishna Bikram

arXiv.org Artificial IntelligenceSep-17-2024

This study presents a novel method for improving rice disease classification using 8 different convolutional neural network (CNN) algorithms, which will further the field of precision agriculture. Tkinter-based application that offers farmers a feature-rich interface. With the help of this cutting-edge application, farmers will be able to make timely and well-informed decisions by enabling real-time disease prediction and providing personalized recommendations. Together with the user-friendly Tkinter interface, the smooth integration of cutting-edge CNN transfer learning algorithms-based technology that include ResNet-50, InceptionV3, VGG16, and MobileNetv2 with the UCI dataset represents a major advancement toward modernizing agricultural practices and guaranteeing sustainable crop management. Remarkable outcomes include 75% accuracy for ResNet-50, 90% accuracy for DenseNet121, 84% accuracy for VGG16, 95.83% accuracy for MobileNetV2, 91.61% accuracy for DenseNet169, and 86% accuracy for InceptionV3. These results give a concise summary of the models' capabilities, assisting researchers in choosing appropriate strategies for precise and successful rice crop disease identification. A severe overfitting has been seen on VGG19 with 70% accuracy and Nasnet with 80.02% accuracy. On Renset101, only an accuracy of 54% could be achieved, along with only 33% on efficientNetB0. A MobileNetV2-trained model was successfully deployed on a TKinter GUI application to make predictions using image or real-time video capture.

accuracy, artificial intelligence, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2410.01827

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Asia > Nepal > Gandaki Province > Kaski District > Pokhara (0.05)
Europe > Switzerland > Basel-City > Basel (0.04)
(3 more...)

Genre:

Overview > Innovation (0.48)
Research Report > Promising Solution (0.48)

Industry: Food & Agriculture > Agriculture (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

WaterQualityNeT: Prediction of Seasonal Water Quality of Nepal Using Hybrid Deep Learning Models

Paneru, Biplov, Paneru, Bishwash

arXiv.org Artificial IntelligenceSep-17-2024

Ensuring a safe and uncontaminated water supply is contingent upon the monitoring of water quality, especially in developing countries such as Nepal, where water sources are susceptible to pollution. This paper presents a hybrid deep learning model for predicting Nepal's seasonal water quality using a small dataset with many water quality parameters. The model integrates convolutional neural networks (CNN) and recurrent neural networks (RNN) to exploit temporal and spatial patterns in the data. The results demonstrate significant improvements in forecast accuracy over traditional methods, providing a reliable tool for proactive control of water quality. The model that used WQI parameters to classify people into good, poor, and average groups performed 92% of the time in testing. Similarly, the R2 score was 0.97 and the root mean square error was 2.87 when predicting WQI values using regression analysis. Additionally, a multifunctional application that uses both a regression and a classification approach is built to predict WQI values.

accuracy, prediction, water quality, (13 more...)

arXiv.org Artificial Intelligence

2409.10898

Country:

North America > United States (0.46)
Asia > Nepal > Bagmati Province > Kathmandu District > Kathmandu (0.05)
North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
(3 more...)

Genre: Research Report > New Finding (1.00)

Industry: Water & Waste Management > Water Management > Water Supplies & Services (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

EEG Right & Left Voluntary Hand Movement-based Virtual Brain-Computer Interfacing Keyboard with Machine Learning and a Hybrid Bi-Directional LSTM-GRU Model

Paneru, Biplov, Paneru, Bishwash, Sapkota, Sanjog Chhetri

arXiv.org Artificial IntelligenceAug-17-2024

This study focuses on EEG-based BMI for detecting voluntary keystrokes, aiming to develop a reliable brain-computer interface (BCI) to simulate and anticipate keystrokes, especially for individuals with motor impairments. The methodology includes extensive segmentation, event alignment, ERP plot analysis, and signal analysis. Different deep learning models are trained to classify EEG data into three categories -- `resting state' (0), `d' key press (1), and `l' key press (2). Real-time keypress simulation based on neural activity is enabled through integration with a tkinter-based graphical user interface. Feature engineering utilized ERP windows, and the SVC model achieved 90.42% accuracy in event classification. Additionally, deep learning models -- MLP (89% accuracy), Catboost (87.39% accuracy), KNN (72.59%), Gaussian Naive Bayes (79.21%), Logistic Regression (90.81% accuracy), and a novel Bi-Directional LSTM-GRU hybrid model (89% accuracy) -- were developed for BCI keyboard simulation. Finally, a GUI was created to predict and simulate keystrokes using the trained MLP model.

accuracy, dataset, virtual keyboard, (12 more...)

arXiv.org Artificial Intelligence

2409.00035

Country:

Europe > Switzerland > Basel-City > Basel (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
North America > Canada (0.04)
(3 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.89)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Health Care Technology (0.93)
Health & Medicine > Therapeutic Area > Musculoskeletal (0.68)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.66)

Add feedback

Evaluating Transfer Learning in Deep Learning Models for Classification on a Custom Wildlife Dataset: Can YOLOv8 Surpass Other Architectures?

Sharma, Subek, Dhakal, Sisir, Bhavsar, Mansi

arXiv.org Artificial IntelligenceJul-10-2024

Biodiversity plays a crucial role in maintaining the balance of the ecosystem. However, poaching and unintentional human activities contribute to the decline in the population of many species. Hence, active monitoring is required to preserve these endangered species. Current human-led monitoring techniques are prone to errors and are labor-intensive. Therefore, we study the application of deep learning methods like Convolutional Neural Networks (CNNs) and transfer learning, which can aid in automating the process of monitoring endangered species. For this, we create our custom dataset utilizing trustworthy online databases like iNaturalist and ZooChat. To choose the best model for our use case, we compare the performance of different architectures like DenseNet, ResNet, VGGNet, and YOLOv8 on the custom wildlife dataset. Transfer learning reduces training time by freezing the pre-trained weights and replacing only the output layer with custom, fully connected layers designed for our dataset. Our results indicate that YOLOv8 performs better, achieving a training accuracy of 97.39 % and an F1 score of 96.50 %, surpassing other models. Our findings suggest that integrating YOLOv8 into conservation efforts could revolutionize wildlife monitoring with its high accuracy and efficiency, potentially transforming how endangered species are monitored and protected worldwide.

accuracy, dataset, val, (14 more...)

arXiv.org Artificial Intelligence

2408.00002

Country:

North America > United States > Minnesota (0.04)
Asia > Nepal > Gandaki Province > Kaski District > Pokhara (0.04)
Asia > Nepal > Bagmati Province > Kathmandu District > Kathmandu (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Law > Environmental Law (0.75)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Tracing the Roots of Facts in Multilingual Language Models: Independent, Shared, and Transferred Knowledge

Zhao, Xin, Yoshinaga, Naoki, Oba, Daisuke

arXiv.org Artificial IntelligenceMar-8-2024

Acquiring factual knowledge for language models (LMs) in low-resource languages poses a serious challenge, thus resorting to cross-lingual transfer in multilingual LMs (ML-LMs). In this study, we ask how ML-LMs acquire and represent factual knowledge. Using the multilingual factual knowledge probing dataset, mLAMA, we first conducted a neuron investigation of ML-LMs (specifically, multilingual BERT). We then traced the roots of facts back to the knowledge source (Wikipedia) to identify the ways in which ML-LMs acquire specific facts. We finally identified three patterns of acquiring and representing facts in ML-LMs: language-independent, cross-lingual shared and transferred, and devised methods for differentiating them. Our findings highlight the challenge of maintaining consistent factual knowledge across languages, underscoring the need for better fact representation learning in ML-LMs.

computational linguistic, knowledge, representation, (15 more...)

arXiv.org Artificial Intelligence

2403.05189

Country:

Europe > Denmark (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
(32 more...)

Genre: Research Report > New Finding (1.00)

Industry: Leisure & Entertainment (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.50)

Add feedback

Best Images Generated by Bing Image Creator - Web3suggest.com

#artificialintelligenceApr-1-2023, 07:45:53 GMT

Bing Image Creator is an innovative tool that allows users to generate stunning images with ease. This tool has a vast database of images and allows users to create images in different styles, including cartoons, 3D images, and optical illusions. As a user of this tool, I must say that I am impressed with the results that I have seen. I first used Bing Image Creator to generate some cartoon characters, and I was delighted with some of the ones it created. The tool allowed me to choose different features for the cartoon characters, such as the size of the eyes, the shape of the nose, and the style of the hair.

bing image creator, creator, image creator, (13 more...)

#artificialintelligence

Country:

Asia > Nepal > Gandaki Province > Kaski District > Pokhara (0.05)
Asia > India (0.05)

Industry: Media > News (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.32)

Add feedback